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Abstract 

The principal problem in algorithmic mechanism design is in merging the incentive con- 
straints imposed by selfish behavior with the algorithmic constraints imposed by computational 
intractability. This field is motivated by the observation that the preeminent approach for de- 
signing incentive compatible mechanisms, namely that of Vickrey, Clarke, and Groves; and the 
central approach for circumventing computational obstacles, that of approximation algorithms, 
are fundamentally incompatible: natural applications of the VCG approach to an approxi- 
mation algorithm fails to yield an incentive compatible mechanism. We consider relaxing the 
desideratum of (ex post) incentive compatibility (IC) to Bayesian incentive compatibility (BIC), 
where truthtelling is a Bayes-Nash equilibrium (the standard notion of incentive compatibility 
in economics). For welfare maximization in single-parameter agent settings, we give a general 
black-box reduction that turns any approximation algorithm into a Bayesian incentive compat- 
ible mechanism with essentially the sam^ approximation factor. In other words, for a large 
relevant class of problems and slight relaxation of the standard (in computer science) solution 
concept, we completely solve the central problem in algorithm mechanism design. 



^More specifically, we obtain an FPTAS: for any e we take a /3-approximation algorithm and give a BIC (1 -|- e)/3- 
approximation mechanism. This (1 4- e) factor arises from statistical methods that seem necessary for a black-box 
reduction. 



1 Introduction 



Can any approximation algorithm he converted into an approximation mechanism for selfish agents ? 
This question is framed by a fundamental incompatibility between the standard economic approach 
for the design of mechanisms for selfish agents (the Vickrey- Clarke- Groves (VCG) mechanism) and 
the standard algorithmic approach for circumventing computational intractability (approximation 
algorithms). The conclusion from this incompatibility, driving much of the field of algorithmic 
mechanism design, is that incentive and algorithmic constraints must be dealt with simultaneously. 
For a large, important class of problems, we arrive at the opposite conclusion: there is a general 
approximation-preserving reduction from mechanism design to algorithm design! 

The goal of mechanism design is to construct the rules for a system of agents so that in the 
equilibrium of selfish agent behavior a desired objective is obtained. For settings of incomplete 
information the standard game theoretic equilibrium concept is Bayes-Nash equilibrium (BNE), 
which is defined by mutual best response when the prior distribution of agent payoffs is commonly 
known. The revelation principle |12j suggests that when looking for mechanisms with desirable 
Bayes-Nash equilibria, one must look no further than those with truthtelling as a Bayes-Nash 
equilibrium, also known as Bayesian incentive compatible (BIG) mechanisms. Almost all of the 
computer science literature has focused on the BIG subclass of ex post incentive compatible (IG) 
mechanisms where truthtelling is a dominant strategy. While IG is aesthetically appealing because 
it is compatible with worst-case-style results, it is not generally without loss! 

Bayesian mechanism design is very well understood in single-parameter agent settings, such as 
(for example) where each agent has a single private value for receiving a service (see, e.g, p^). For 
the single parameter setting it is known that a mechanism is BIG if and only if (a) the probability 
an agent is served (a.k.a. the allocation rule) is monotone non-decreasing in the agent's value for 
service, and (b) the expected payment (a.k.a. the payment rule) is identified in a particular way 
from the allocation rule |12j . Probabilities and expectations above are taken with respect to both 
the distribution of agent values and possible randomization in the mechanism. 

The main challenge in reducing BIG (or IG) mechanism design to algorithm design is that 
approximation algorithms do not generally have monotone allocation rules. Our reduction shows 
that in a Bayesian setting we can convert any non-monotone allocation rule into a monotone 
one without compromising its performance. The main technical observation that enables this 
reduction is that, in a Bayesian setting, we can focus on a single agent for whom the allocation rule 
is not monotone, apply a transformation that fixes this non-monotonicity (and weakly improves 
our objective), and no other agents are affected (in a Bayesian sense). Thus, we can apply the 
transformation independently to each agent. 

Our reduction is as follows: 

1. For each agent, identify intervals in which the agent's allocation rule is non-monotone. (This 
is a property of the distribution and can be done prior to considering any agent bids.) 

2. For each agent, if their bid falls in an (above identified) interval, redraw the agent's bid from 
the prior distribution conditioned on being within the interval. 

3. Run the approximation algorithm on the resulting bids and output its solution. 

Notice that under the assumption that the original values are drawn according to the common 
prior, the redrawing of values does not alter this prior. 
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Two items must be clarified. First, there are many ways one might attempt to monotonize 
a non-monotone function (i.e., choose intervals in step 1 of the reduction), and most of them are 
incompatible with mechanism design. To address this issue we develop a monotonizing technique for 
allocation rules, adapted from the a standard ironing procedure from the field of Bayesian optimal 
mechanism design (introduced by Myerson [12j for monotonizing virtual valuation functions, a 
fundamental construct for the objective of profit maximization). Second, we must also determine 
payments for our monotonized allocation rule. For this, a general approach of Archer et al. [2j 
suffices. 

Selecting intervals of non-monotonicity as described above requires knowing the allocation rule 
precisely. If our algorithm is only given as a black box, it is unlikely we will have this precision. 
Therefore, we describe a statistical sampling approach that discretizes the valuation space and 
estimates the allocation rule in this discrete space to a high level of accuracy. Because this approach 
may fail to notice and fix a non-monotonicity, to achieve a truly BIC mechanism we will have 
to take a convex combination of our mechanism with one that is blatantly monotone (but does 
not approximate our objective). Unsurprisingly, in polynomial time we can achieve arbitrarily 
precise estimates which means our loss from this convex combination can be made arbitrarily small 
(resulting in a fully polynomial time approximation scheme (FPTAS)). Alternatively, we could 
ignore these small non-monotonicities and the resulting mechanism will have truthtelling as an 
e-Bayes-Nash equilibrium where e can be made arbitrarily small. 

The Bayesian approach to mechanism design is relevant for two reasons. The first, as described 
above, is that it is the standard solution concept in settings of incomplete information. The second 
is that for many relevant settings the worst-case approximation factor of the best algorithm is 
so bad that it is completely unrealistic to believe that a designer would be happy with such an 
approximation H For instance, for single-minded combinatorial auctions of m items, the optimal 
worst-case approximation factor (under standard complexity theoretic assumptions) is ^/m 
However, with relevant structure on the desired bundles and a distribution over valuations, it is 
possible to do much better. This motivates the consideration of the algorithmic problem of Bayesian 
(a.k.a. stochastic) approximation. Given a distribution over instances, the Bayesian approximation 
factor of the algorithm is the ratio of the expected welfare of the optimal allocation to the expected 
welfare of the algorithm. Notice that the class of worst-case /3-approximations is a subclass of 
the the class of Bayesian /^-approximations. All of our results apply to this more general class of 
Bayesian /3-approximations algorithms. 

For Internet settings for mechanism design such as online auctions (eBay), advertising auctions 
(Google, Yahoo!, MSN), file sharing (BitTorrent), routing (TCP/IP), scheduling (SETI@home), 
video streaming (YouTube), etc. it is quite reasonable to believe that detailed statistics about 
distributions of user demand are available to the designer. It would be impractical to ignore this 
information in designing mechanisms and from a lower-bounding point of view unrealistic not to 
assume the distribution is known. The BIC equilibrium notion is the relevant one for these settings. 
Note that it is not the case that agents in a BIC mechanism must themselves know the common 
prior distribution, as long as they trust that the mechanism designer does. Thus, we posit that 
that BIC and Bayesian approximation are the appropriate notions for a theory of mechanism design 
that is relevant to Internet systems. 

Our results apply generally to single-parameter agent settings where the designer's objective is 
to maximize the social welfare (e.g., single-minded combinatorial auctions). In the most general 

■^This is the basis for the economics community's dissatisfaction with worst-case approximation. 
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form, such an algorithmic problem can be written as finding an allocation x = (xi, . . . ,Xn) to 
maximize ViXi — c(x) for agent valuations v = {vi, . . . , Vn) and cost function c(-). For instance, 
the multicast auction problem of Feigenbaum et al. [10] is the special case where the c(x) is the 
sum of the costs of tree edges necessary to connect all agents served by x to the root (generally, the 
Steiner tree problem). A special and relevant case occurs when costs are zero for x in some feasible 
set system X and all other allocations are infeasible (i.e, c(x) = if x G ^ and oo otherwise). For 
the single-item auction, X is the collection of all sets of cardinality at most one; and for single- 
minded combinatorial auctions, X contains all sets of agents with non-intersecting desired bundles. 
Our most general result does not need any restrictions on the cost function or the set system. 
In particular, costs can be arbitrarily non-monotone or the set system non-downward-closed (e.g. 
public good or scheduling problems). 

For the most studied single-parameter mechanism design problems, the performance of the 
best approximation mechanism matches the best approximation algorithm (e.g., single-minded 
combinatorial auctions [11] and related machine scheduling [9]). None-the-less, our approach gives 
the best known approximation mechanism for many problems, such as auctions under various graph 
constraints [Ij and auctions of convex bundles |4J- 

Our result that there is no gap between algorithmic approximation and approximation by BIG 
mechanisms essentially solves the fundamental problem single-parameter algorithmic mechanism 
design. The remaining (theoretical) question of gaps in approximation factors imposed by incentive 
constraints is thus focused on whether BIG is more powerful than IG for social welfare maximization. 
For other non-welfare-maximization objectives (e.g. makespan) the question of a general reduction 
remains open. 

Related Work Lehmann et al. [ll] introduced the problem of computing the single-minded com- 
binatorial auction and give a mechanism that matches the best algorithmic approximation factor. 
Archer et al. [2j considered the setting where there are many (at least logarithmic) copies of each 
item and gave a (1 -|- e)-approximation mechanism. Archer and Tardos [3] gave (single-parameter) 
related machine scheduling mechanism that approximates the makespan. Dhangwatnotai et al. ^ 
gave an approximation mechanism for related machine scheduling that approximates makespan and 
matches the algorithmic lower bound. All of the above results are for ex post incentive compatible 
mechanisms. 

There has been a large literature on multi-parameter combinatorial auctions and approximation, 
but this is only tangentially related to our work so we do not cite it exhaustively. 

Babaioff et al. [5] looks at the equilibrium notion of algorithmic implementation in undominated 
strategies and gives a general technique for turning an /3-algorithm into a /?(logt;maa;)-approximation 
mechanism. This solution concept requires that no agent plays a strategy that is dominated by an 
easy to find strategy. Their approach applies to single-valued combinatorial auctions and does not 
require the mechanism to know which bundles each agent desires. 

Christodoulou et al. [8] looks at Bayes-Nash equilibrium in simultaneous Vickrey auctions in 
a combinatorial setting and shows that these give a 2-approximation when agents' valuations are 
submodular. 

There are many papers for profit maximization that consider Bayesian design settings. These pa- 
pers do not tend to consider computational constraints and for many of these (non-computational) 
settings the restriction to ex post incentive compatibility is without loss. One example where ex 
post incentive compatibility is with loss is when the profit maximizing seller has a strict no-deficit 
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constraint [7]. 

To our knowledge no prior work looks at computational questions related to approximation of 
social welfare by a Bayesian incentive compatible mechanism. 

2 Model and Definitions 

Algorithms. We consider algorithms for binary single-parameter agent settings. An algorithm 
in such a setting must select a set of agents to serve. This dllocation is denoted by x — (^i^ • • • ? ^n) 
where Xi is an indicator for whether or not agent i is served. Agent i has valuation Vi for being 
served. The vector v = (f i, . . . , v^) of valuations is the valuation profile. The seller may have some 
cost function c(-) over allocations that represents the cost of serving the allocated set. A special 
case are zero-infinity cost functions where the set of allocations with zero cost are feasible and those 
with infinite cost are infeasible. 

The objective we consider is social welfare maximization, i.e., to find the allocation x to maxi- 
mize UjXj — c(x). We refer to this as the general costs setting and it includes, for example, Steiner 
tree problems. We refer to the special case where the cost function is zero or infinite as the gen- 
eral feasibility setting. General feasibility problems include scheduling and public good problems. 
An important subclass are downward-closed (feasibility) settings where any subset of a feasible set 
is feasible. Downward closed settings include single-minded combinatorial auctions and knapsack 
problems. For any setting, OPT(v) will denote the maximum social welfare. 

An algorithm A is simply an allocation rule that maps valuation profiles to allocations. The 
allocation rule for A we will denote by x(v). The social welfare of an algorithm is ^(v) = 
^jfiXj(v) — c(x(v)). We allow A to be randomized in which case Xi{v) is a random variable; 
^(v) denotes the expected welfare of the algorithm for valuation profile v. 

We will be considering these algorithmic problems in a Bayesian (a.k.a. stochastic) setting where 
the valuations of the agents are drawn from a product distribution F = Fi x • • • x Fn. Agent i's 
cumulative distribution and density functions are denoted Fi and fi, respectively. The distributions 
are assumed to be common knowledge to the agents and designer. Some of our bounds will be 
based on Vmax, an upper bound on any agent's valuation, and //max = inaxjE[fj], the maximum 
expected valuation of any agent. 

The pair (c(-),F) defines a setting for single-parameter algorithm design which we will take 
as implicit. For this setting, the optimal expected welfare is OPT = Evr^F[OPT(v)] and the 
algorithm's expected welfare is „4 = Ev~f[-^(v)]. An algorithm is a worst-case (3 -approximation if 
for all V, ^(v) > 0PT(v)//5. An algorithm is a Bayesian f3- approximation if ^ > OPT /(3. 

Mechanisms. A mechanism M consists of an allocation rule and a payment rule. We denote 
by x(v) and p(v) the allocation and payment rule of an implicit mechanism Ai. We assume the 
agents are risk neutral and individually desire to maximize their expected utilities. Agent i's utility 
for allocation x and payments p is ViXi —pi. We will consider single-round, sealed-bid mechanisms 
where agents simultaneously bid and the mechanism then computes the allocation and payments. 

Our goal is a mechanism that has good social welfare in equilibrium. The standard economic 
notion of equilibrium for games of incomplete information is Bayes-Nash equilibrium (BNE). The 
revelation principle says that any equilibrium that is implementable in BNE is implementable with 
truthtelling as the BNE strategies of the agents H Meaning: an agent that believes that the other 

^The revelation principle holds even in computational settings; any BNE for which the agent strategies and the 
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agents are reporting their values truthfully as given by the distribution has a best response of also 
reporting truthfully. A mechanism with truthtelling as a BNE is Bayesian incentive compatible 
(BIC)EI 

It will be useful to consider agent i's expected payment and probability of allocation conditioned 
on their value. Denote Pi{vi) = Ev,A^|j'i(v) | Vi\ and Xi{vi) = Ev,A^[xj(v) | Vi\. The following 
theorem characterizes BIC mechanisms. 

Theorem 2.1 A mechanism is BIC if and only if for all agents i: 

• Xi{vi) is monotone non- decreasing, and 

• Piivi) = ViXi{vi) - f^' Xi{z)dz + pi{0). 
Usually, Pi{0) is assumed to be zero. 

This motivates the following definition: 

Definition 2.1 An allocation rule x(-) is monotone for distribution F if Xi{vi) is monotone non- 
decreasing for all i. An algorithm is monotone if its allocation rule is monotone. 

From Theorem l2.lt BIC and monotone are equivalent and we will use them interchangeably for both 
algorithms and mechanisms, though we will prefer "BIC" when the focus is incentive properties 
and "monotone" when the focus is algorithmic properties. 

Computation. Our main task in demonstrating that the approximation complexity of algorithms 
and BIC mechanisms is the same by giving an approximation-preserving reduction from the BIC 
mechanism design problem to the algorithm design problem. In other words, we use the algorithm's 
allocation rule to compute the mechanism's allocation and payment rules. As we are in a Bayesian 
setting this computation will also need access to the distribution. We consider two models of 
computation: an ideal model and an oracle model. 

In the ideal model, we will assume we have explicit access to the functional form of the distri- 
bution and allocation rule and we will assume we can perform calculus on these functions. While 
this model is not realistic, we present it for the sake of clearly explaining the economic theory that 
drives our results. In the oracle (a.k.a. black-box) model we will assume we can query the algorithm 
on any input and that we can sample from the distribution on any subinterval of its support. Our 
philosophy is that the ideal model is predictive of what is implementable in polynomial time and we 
verify this philosophy by instantiating approximately the same reduction under the oracle model. 

We give our algorithm runtimes in the oracle model in terms of the number of calls to the 
algorithm oracle. This is justified as each such call requires at most a linear number of calls to the 
distribution oracle and no significant additional computation is needed. 

mechanism can be computed in polynomial time can be converted into a polynomial time BIC mechanism. 

^Much of the computer science literature on mechanism design focuses on dominant strategy equilibrium (DSE) 
and ex post incentive compatibility (IC). This is not without loss in many settings and therefore should be considered 
with care when addressing computational questions in mechanism design. Whether it is without loss in computational 
settings is an open question. 
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3 Reduction: the Ideal Model 



Theorem 3.1 In the ideal model and general cost settings, a BIC Bayesian (3 -approximation A 
can be computed from any Bayesian P- approximation algorithm, A. 

Suppose that we are given an algorithm A that is monotone for the distribution F. Then A is 
aheady BIC and specifies the ahocation rule, so we must only compute the payment rule. In our 
ideal model this is trivial given the formula from Theorem 12.11 

Now suppose we have a non-monotone Bayesian c-approximation algorithm A with allocation 
rule x(-). We would like to use A to construct a monotone algorithm A from which we can obtain 
a BIC mechanism by simply computing the payment rule as above. We must make sure that in 
doing so we do not reduce the algorithm's expected welfare. The key property of our approach 
which makes it tractable is that we monotonize each agent's allocation rule independently without 
changing (in a Bayesian sense) the allocation rule any other agent faces. This property is also 
important for the approximation factor as A is guaranteed to be a Bayesian c-approximation only 
for the given distribution F, and may not be a good approximation for some other distribution. 

In summary, the desiderata for monotonizing agent i are: 

Dl. monotone Xi{vi), 

D2. (weakly) improved welfare Eiv-[viXi{vi)] > E^. [fjXj(uj)], and 
D3. other agents unaffected. 
Notice that if we satisfy the last condition we can apply the process simultaneously to all agents. 

3.1 Ironing via Resampling 

There is a history of fixing non-monotonicities in Bayesian mechanism design. Notably, Myerson 
invented a technique of ironing which relies on the fact that if an allocation rule is constant over 
some interval then any agent within that interval is effectively equivalent to a canonical "average" 
agent from that interval. Myerson applied this theory to iron virtual valuation functions which are 
a crucial construct in Bayesian profit maximization [12]. We will apply this theory directly to the 
allocation rule. 

Before we describe our ironing procedure in full, let us develop some intuition. Suppose alloca- 
tion rule Xi(-) of .A is non-monotone for agent i. A simple approach to flattening non-monotonicities 
is to choose some interval [a, b] on which Xj(-) is non-monotone, and treat agent i identically when- 
ever Vi G [a,b]. For example, whenever Vi S [a,b] we could choose to pretend that Vi is actually 
some other fixed value v' (e.g. v' = a), and pass this "pretend" value v' to the algorithm. Unfor- 
tunately, if we take this naive approach, we would have changed the distribution of agent i's input 
to the algorithm (in particular, the probability of value v' would be increased) and violated D3. In 
order to maintain D3 we make a minor modification: instead of picking a fixed v' , we will draw 
v' from Fi restricted to the interval [a,b]. Thus, we are replacing Vi G [a, 6] with v' drawn from 
the same distribution. Other agents cannot tell the difference - this operation does not change 
the distribution of agent i's input! Moreover, agent i will indeed be treated identically whenever 
Vi G [a,b]: the new probability of allocation will be precisely the distribution weighted average of 
Xj(-) over interval [a,b]. 
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(a) 



(b) 



Figure 1: (a) A non-monotone ironing (solid) of curve Xi (dashed), (b) The corresponding 
integral curves G (dashed) and G' (solid) in probability space. 

Let Xi{vi) represent the allocation rule we obtain from the following procedure (when v_j ~ 
F-i): 

1. if G [a, h], redraw v' ~ Fj[a, 6], 

2. else, set v' = Vi. 

3. run A{v' , v_j). 

We say that x/(-) is the curve Xj(-) ironed on interval [a, 6]. We note that Xi{vi) = Xi{vi) for 
Vi [a, 6] and Xi{vi) = 'Eiyi^p^[xi{v') \ v' E otherwise. This is illustrated in Figure [Dj^a) . 

Note that we can easily iron along multiple disjoint intervals, redrawing v' from whichever interval 
contains Vi (if any). 

Figure HJa) demonstrates that an ironed allocation rule is not necessarily monotone. We now 
explore a method for choosing intervals on which to iron in order to obtain monotonicity. It will be 
instructive to consider the allocation rule in probability space instead of valuation space and the 
cumulative allocation rule (also in probability space). 

1. Let g{q) = Xi{Fi~'^ (q)) be the allocation rule in probability space. 

2. Let G{q) = g{z)dz be the cumulative allocation rule. 

We make two important observations: Xi is monotone if and only if g is monotone, and g is monotone 
if and only if G is convex. Let be Xi ironed on interval [a,b], and consider the corresponding 
curves g' and G' . Then G' and G are equal, except on [a,b] where G' is a straight line connecting 
(a, G{a)) to (6, G{b)). See Figure mb). We can therefore view our interval selection problem as the 
problem of replacing portions of curve G with straight line segments, so that the resulting curve 
G' will be convex. This is precisely the problem of finding the convex hull of G\ Thus the choice 
of intervals that monotonizes Xi is precisely the set of intervals defined by the convex hull of G. 

3. Let G(-) be the convex hull of G{-). 

4. Let li = {Ii, I2, . . .} be the intervals in valuation space where G{Fi{vi)) > G{Fi{vi)) 
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We refer to Xj as the set of monotonizing intervals for Xj. Once we have found Xj, we simply 
apply our ironing procedure to the intervals in Xj. That is, whenever Vi S / G Xj, we redraw v from 
the distribution restricted to / and input v into A instead of Vi. The resulting allocation rule will 
be Xi{vi) = g{Fi{vi)), which is monotone since G is convex. See Figure [2l 

3.2 The Ironed Algorithm 

We are now ready to define our ironed algorithm A. 

Definition 3.1 (A) Given a profile of disjoint interval sets X = {Xi, . . . ,X„}, the ironed algorithm 
Ax for A with intervals X is: 

1. For each agent i, if Vi £ I £ li, draw Vi ~ Fi[I]; else, set Vi = Vi- 

2. Run A{v). 

The ironed algorithm A for A computes the monotonizing intervals li for each agent, as described 
in the previous section, then runs Aj{v). 

Lemma 3.2 A is monotone. 

Proof: It is enough to show that each agent has a monotone allocation rule. By construction the 
allocation rule satisfies Xi{vi) = g{Fi{vi)), which is the derivative of a convex function and therefore 
monotone. □ 

Lemma 3.3 If A is a Bayesian (3 -approximation then A is a Bayesian (3 -approximation. 

Proof: First notice that the two allocation rules produce the same distribution over allocations 
and therefore expected costs are identical. We will show, for a single agent i, that E[uiXi(fj)] > 
^\viXi{vi)] (i.e., D2), from which linearity of expectation implies the result. In fact, the former 
stochastically dominates the latter: for any Uj, Xi{z)fi{z)dz > Xi{z)fi{z)dz, since G{q) < 
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G{q) for all q and thus there is more weight in the expectation on higher values for Xj(-) than for 

X^{■). □ 

Theorem 13.11 follows immediately from Lemmas 13.21 and 13.31 An immediate corollary for worst- 
case approximation algorithms follows since worst-case approximations are a subclass of Bayesian 
approximations . 

Corollary 3.4 In the ideal model and general cost settings, a BIC Bayesian [5 -approximation A 
can he computed from any worst-case (3 -approximation algorithm, A. 



Notes. We make the following notes about our main result (detailed discussion is given in our 
conclusions) . 

• The argument fails for non-linear objectives such as makespan. While D2 holds, it will not 
lead to an overall bound on the expected performance. See Appendix lA.il for an example. 

• Our ironing procedure is distinct from Myerson's. Myerson irons virtual valuations and 
allocates to maximize ironed virtual value. This is not the same as maximizing virtual value 
and then ironing the allocation rule where it is non-monotone. See Appendix IA.3I for an 
example. 

• If ^ is a worst-case c-approximation, A may fail to be a worst-case c-approximation. See 
Appendix IA.2I for an example. 



4 Reduction: the Oracle Model 

We will use the ironing procedure from the previous section in order to monotonize an algorithm in 
the oracle model. In the ideal model, we used knowledge of an algorithm's allocation rule in order 
to find the monotonizing intervals for each agent. In the oracle model our approach will be the 
same, except that instead of using direct knowledge of the allocation rule we must use sampling 
to estimate it. This sampling introduces errors which must then be dealt with. Our analysis will 
proceed in the following steps. 

1. We describe a method for computing payments in the oracle model. 

2. We describe a method for combining sampling with ironing to obtain a nearly monotone 
algorithm. In fact, this algorithm will be efmax-Bayesian incentive compatible!! 

3. We show that a convex combination of this nearly monotone algorithm with a blatantly 
monotone one will give a monotone algorithm. Coupled with the payment rule, this gives a 
Bayesian incentive compatible mechanism. 

All of these steps approximately preserve social welfare. We obtain the following main theorem 
and corollary. (Recall: Vmax is an upper bound on any valuation and ^max is the maximum over 
the agents' expected valuations.) 



^This intermediate result may be desirable in itself. If Vmax is polynomially bounded, we can get an e-BIC 
mechanism in polynomial time. 
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Theorem 4.1 In the oracle model and general cost settings, for any e > 0, a BIC algorithm Ae 
can be computed from any BdycsiciTi olgoTithTfi J\.. Its social wdfciTc satisfiGS ^ *A — ^/^max* 

The 

runtime is 0{n^e~^log^{vmax/^fJ'msix))- In general feasibility settings, the runtime can be improved 

to 0(n^e~^log^(fmax/e/"max))- 

For the special case of downward-closed set systems for feasibility problems, we can assume that 
A > /^max) since the trivial algorithm that simply allocates to the single player with the highest 
input value attains this value. This implies the following corollary. 

Corollary 4.1 In the oracle model and downward-closed feasibility settings, for any e > 0, a BIC 
Bayesian [5 {\ + e)- approximation algorithm Ae can be computed from any Bayesian (3 -approximation 
algorithm A. Its runtime is 0{n^e~^\og^{vmax/^I^iaa.x))- 

4.1 Computing Payments 

Suppose that A has monotone allocation rules. The problem of designing a mechanism to implement 
A then reduces to calculating appropriate payments. These payments are completely determined by 
the allocation rule of A, but in the oracle model we do not have direct access to the functional form 
of the allocation rule. Archer et al. [2] solve this problem by computing an unbiased estimator of 
the desired payment rule using only oracle calls to the algorithm. For completeness we summarize 
their approach below. 

Definition 4.1 (oracle payment procedure) // A does not allocate to agent i, then agent i 
pays 0. Otherwise, we compute the payment of agent i as follows: 

1. Choose Vi uniformly from [0,Vj] 

2. Draw v'_- ~ F_j and run ^(w/, v'_J 

3. If A allocated to agent i in the previous step set X = vi, otherwise set X = 0. 

4- If X ^ 0, repeatedly draw - ~ F_j and run A{vi, until the algorithm allocates to player 
i, and let T be the number of iterations required. 

5. Agent i 's payment is Pi = Vi — TX . 

As was shown by Archer et al., this computation attains the appropriate expected payment. 

Claim 4.2 (Archer et al. [2]) The oracle payment procedure produces payments pi{vi) = ViXi{vi) — 
lo' Xi{z)dz. 

We note that since we execute this procedure for agent i only if he receives an allocation, which 
occurs with probability Xi{vi), the expected number of calls to A for each player is at most 

Thus, in expectation, all payments can be computed with 2n calls to A. 

Notice that any mechanism paired with this payment scheme will be individually rational (IR), 
meaning that a truthtelling agent will never obtain negative utility. This is true even if the allocation 
rule is not monotone. This follows immediately from the fact that the payment for an agent that 
declares value Vi is never greater than Vi (indeed, it is defined as Vi minus a non- negative value). 
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4.2 Sampling and e-Bayesian Incentive Compatibility 

We will be estimating the allocation rule of our non-monotone algorithm and attempting to iron it. 
This will fail to result in an absolutely monotone rule. In this section we show that a nearly mono- 
tone rule results in truthtelling as an e-Bayes-Nash equilibrium (e-BNE): the most an agent can gain 
from a non-truthtelling strategy is an additive e. We call such a mechanism e-Bayesian incentive 
compatible (e-BIC). While there is no characterization of payment rules for e-BIC mechanisms, the 
payment rule given by Theorem 12.11 will be sufficient. 

Definition 4.2 (e-BIC) A mechanism is e-Bayesian incentive compatible if truthtelling obtains at 
least as much utility as any other strategy, up to an additive e, assuming all other agents truthtell. 
That is, for all i, vi, and v' , 

ViXi{vi) - pi{vi) > Vix{v') - Pi{v') - e. 

The main theorem of this section is the following. 

Theorem 4.2 In the oracle model and general cost settings, for any e > 0, an eVmax-BIC algorithm 
A! can be computed from any Bayesian algorithm A. Furthermore, A' > A — e/Umax- The running 
time of A' is at most d{n^e~'^logi^^{vmax/eHma.x))- 

As in Corollary 14.11 for the special case of downward-closed set systems, we can assume that 
A > /Xmax (since any algorithm could simply choose to allocate only to the one player with highest 
expected value). This implies the following corollary. 

Corollary 4.3 In the oracle model, for any e > 0, an eVmax-BIC Bayesian (3 {\ + e)- approximation 
algorithm A! can be computed from any Bayesian (3 -approximation algorithm A for a downward- 
closed set system. The running time of A! is at most 0{n^e^'^\ogi_^^{vmax/(-IJ'Uiax)- 

4.2.1 e-closeness 

We now formalize a closeness property under which an allocation rule that is close to monotone is 
e-BIC (for some related e). 

Definition 4.3 (e-close) Allocation rules x(-) and x'{-) are e-close if \x{v) — x'{v)\ < e for all v. 
Two algorithms or mechanisms are e-close if each agent's allocation rules are e-close. 

Lemma 4.4 // )-BIC. 

Proof: Suppose agent i is participating in A' and has value Vi, but claims to have value f/. 
Assume Vi > v/; the opposite case is similar. Using the payment rule from Theorem 12. H agent i's 
gain in utility from declaring Vi is: 

{ViXi{Vi') -pi{Vi)) - {ViXi{Vi) -pi{Vi)) = {Vi - Vi)Xi{Vi) - / Xi{z)dz. (1) 

Jv,' 

Since x/(-) is e-close to a monotone curve, it must be that Xi{z) + e > Xi{vi) — e for all z S [vi , Vi\. 
Thus J!^) Xi{z)dz > {vi — Vi){xi'{vi) — 2e). This implies that the value in ([T|) is at most 2e(fj — w/), 
which is at most 2evmax- ^ 
Now we show that if two algorithms have e-close allocation rules, then they obtain similar 
welfare. In the statement of Lemma 14.51 we assume that the expected costs of both rules are the 
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same. This is implied by our tcclmiqucs: since our mechanism runs the original algorithm on values 
from the original distribution, the expected cost of our mechanism is equal to the expected cost of 
the original algorithm. 

Lemma 4.5 If A and A' have the sauie expected costs and ave e-close then A! A — ^^/^max* 

Proof: For each agent i, E[wjXj'(wj)] > E[ui(a;i(fi) — e)] > '^\piXi{vij\ — ^\pi\. The result then 
follows by linearity of expectation. □ 
Finally, we note that the property of e-closeness persists when algorithms are ironed on some 
collection of intervals. 

Lemma 4.6 // algorithms A and A' are e-close, then for any collection I = {Ii , . . . , X„} of interval 
sets, ironed algorithms Ai and Aj are e-close. 

Proof: For any i, let Xi and be the allocation rules of A and A', respectively. Let Xi and 
be the allocation rules of Aj and Aj. Then for any I G Ii, 

\xi{I) - = \E^[x{v) \vel]- Ey[x'{v) \vel]\ = \E^[x{v) - x'{v) \ v e I]\ < e. 

□ 



4.2.2 Discretization 

A key step in our reduction will be in discretizing the allocation rules of the algorithm. This is 
needed for two reasons: 

• We need to estimate the allocation rule and so we need to pick a polynomial number of points 
on it to estimate. (This is the focus of the next step.) 

• Our resulting allocation will be almost monotone. By discretizing we will end up with a 
polynomial number of points in which it is non-monotone and we will know exactly where 
these points are. (In a subsequent step, a convex combination of the sampled algorithm with 
a blatantly monotone one will fix the non-monotonicitics.) 

We say that an algorithm is k-piece piecewise constant if for each i there is a partition of 
valuation space into at most k intervals such that the allocation rule for agent i is constant on each 
interval. Our discretization procedure, which converts any algorithm into a piecewise constant 
algorithm, is the following. 

Definition 4.4 {Ae) Given algorithm A and e > 0, the discretization of A, Ae, is the ironed 
algorithm Ax, where X = {Ti, . . . is the collection of intervals defined by 

Ti = {[0, eMma.)} U { [e/X^ax(l + e)*, e/^ms«(l + e)*+') }o<t<logi+,(«^a./6Mma.) • 

Lemma 4.7 For any algorithm A and e > 0, Ae is logi_|.g(i;max/e/^max)-7'^ece piece-wise-constant 
for all i, and Ae> A — 2ne/Xinax- 

Proof: The allocation curves for Ae arc constant on interval [0, e/Ltmax) and all intervals of the 
form [e;Umax(l + e)*, e/"max(l + e)*^^), and there are at most logi_|_^(t;max/eAimax) such intervals 
over the range [e'/^maxj ^^max]- These intervals do, indeed, partition valuation space. Furthermore, 
'^vi[viXi{vi)] > (1 - e)Ey.[viXi{vi)] - e/imax > 'Eivi[viXi{vi)] - 2e/Xi„ax, as algorithm Ae modifies any 
input value greater than e/Xmax by at most a factor of (1 — e). As the expected costs before and 
after discretization are the same, the result follows from linearity of expectation. □ 
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4.2.3 Statistical Estimation 



We next describe a sampling procedure for estimating an allocation rule. This procedure will not 
form an algorithm, but rather generates an estimated allocation curve, which we will denote by 
y(-). This estimate behaves like an allocation rule, but is not associated with an actual algorithm. 
Nevertheless, we can pretend that it is an algorithm and apply Lemma 14.41 and Lemma 14.51 to it. 
The appropriate way to view this is "if y was an allocation rule for an algorithm, then ..." 

Definition 4.5 (estimated allocation rule) Given algorithm A which is k-piece piece-wise-constant 
and e > 0, an estimated allocation rule for A is a curve y(-) found as follows: 

1. for each agent i and valuation- space piece Ij, draw 4(e)^^ log {2k/ne) samples from F condi- 
tional on Vi £ Ij, and run A on each of these samples. 

2. let Uij be the average allocation over the invocations to A above, for each i and j. 

3. Define y by yi{v) = yij for all v G Ij 

Lemma 4.8 Suppose algorithm A is k-piece piece-wise-constant. Then, for any e > 0, an estimated 
allocation rule y(-) is k-piece piece-wise-constant, and is ^-close to x(-) with probability at least 1— §• 
The construction o/y(-) requires 0(nA;(e)~^ log (2/c/ne)) black-box calls to A. 

Proof: The execution time and the fact that y(-) is k-piece piecewise constant follow immediately 
from the definition. Choose some i and let Ij denote piece j of the valuation space for agent i in 
A, and write Xi{Ij) for the (constant) value of Xi{v) for any v G Ij. By the Hoeffding-Chernoff 
inequality, the probability that \yij-Xi{Ij)\ > e/2 is at most e-'^(^)"' i°g(2fc/ne)(e/2)2 < g/2A;n. Thus, 
taking the union bound over all i and j, we conclude that 



4.2.4 Statistical Ironing 

We are now ready to combine our sampling procedure with the ironing procedure from the ideal 
model to construct an e-BIC algorithm from a piecewise constant algorithm A. 




for all i and j with probability at least 1 — f • 



□ 



Definition 4.6 {Ae) Given piecewise constant algorithm A and e > 0, the statistically ironed 
algorithm Ae for A with error e is.' 



1 



Find the estimated allocation rule y(-) for A. 



2. 



Find monotonizing intervals I for y(-) (as described in Section \3. 



3. 



Run ^i(v). 



Lemma 4.9 Ae is 2eVmax-BIG and Ae> A — ne^_ 



-max ■ 
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Proof: By Lemma 14.81 yi{-) is fe-piece piece-wise-constant for each i. Let Ay be the (fictional) 
algorithm with allocation rule y. Since 2 is the monotonizing interval set for ^y, if Ay were ironed 
according to X, the result would be Ay which is monotone. 

By Lemma |4.8( Ay is |-close to A with probability 1 — f • In this case, Lemma 14.61 implies Aj 
is |-close to ^y. For the remaining probability, |, we note that Aj is trivially 1-close to Ay. Thus, 
taking expectation over all possible outcomes of the sampling, we conclude that Ae is e-close to 
monotone, and is therefore 2efm,aa;-BIC by Lemma |4.4[ 

Since Ax is | close to ^y with probability 1 — f , Lemma 14.51 and Lemma 14.71 imply that with 
probability 1 — f , 

1 1 

Ax ^ Ay ~n'^fJ'max ^ Ay "'T'fi/^max ^ A H-e^max' 

For the remaining probability, |, we note that trivially Ax > ^ — ^ > ^ — ?^/Umax• Thus, taking 
expectation over all possible outcomes of sampling, we conclude Ae > A — nefimax- ^ 
Theorem 14. 21 now follows by taking A' to be the statistically ironed algorithm A^' applied to the 
discretization A^' of A, where e' = e/3n. Then by Lemmas l4.9l and l4.7^ A' is 2e'fmax-BIC, and hence 
ei^max-BIC, and A' > A^' — ne'fimax > A — 3ne' Umax = A — efimax- The runtime (which is dominated 
by sampling in the construction of y) is 0{nke'~^ log {2kn/e'))) = 0{n^e^^ log(t'maa;/e/^max)), where 
recall k = logi_|^^(t'max/eAimax) is the number of discrete intervals in A^'- 

4.3 Bayesian Incentive-Compatibility 

We now outline machinery by which we can make an e-BIC mechanism BIC, outright, without 
much additional loss. This approach will apply to any e-BIC mechanism with allocation rules that 
are piecewise constant. 

4.3.1 An Explicitly Monotone Algorithm 

Our approach will be to form a convex combination of an algorithm A that is close to monotone 
with the following explicitly monotone algorithm. 

Definition 4.7 Suppose A has k-piece piece-wise constant allocation rules, and suppose Si, . . . ,Sn 
are allocations such that i ^ Si for all i. The stair algorithm for A, S"^, does the following: 

1. Pick an agent i uniformly from the n agents. 

2. If Vi is in the jth highest piece of k pieces, allocate to Si with probability (j — l)/{k — 1). 

We observe that if A is e-close to a monotone algorithm, then a convex combination of A and S"^ 
will be monotone. 

Lemma 4.10 If A' with k-piece piece-wise-constant allocation rules is e-close to a monotone A, 
then A, the convex combination of A' with probability 1 — 6 with 5"^ with probability 6, is BIC, 
where 6 = 2{k — l)ne. 

Proof: To show A is BIC, choose any agent i and any values Vi < f/; we will show Xi{vi) < Xi{vi). 
If Vi,Vi are in the same piece of the valuation space then Xi{vi) = Xi{vi ). Otherwise, since A! is 
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e-close to monotone A, it must be that Xi{vi) < Xi{vi) — 2e. Furthermore, if s(-) is the aUocation 
rule for 5"^ , then Si{vi) < Si{vi ) + l/{k — l)n. We conclude that 

Xi{vi) = (1 - 5)xi{vi) + 5si{vi) 

< x,{vi')-2e + 5/{k-l)n 

= Xi{Vi) 

as required, since 5 = 2{k — l)ne' . □ 
We like to apply Lemma 14.101 to Ae from Lemma |4.9( thereby proving Theorem 14.11 However, 
one issue has not been addressed: how to find sets 5i, . . . , 5„. In many settings finding such sets 
is trivial. For example, in downward-closed feasibility problems (e.g. combinatorial auctions), we 
could simply take Si = {i} and Theorem 14.11 follows immediately. However, in general settings, it 
may be difficult to find feasible sets (or low-cost sets in general cost settings). The remainder of 
this section will be devoted to describing a general process by which sets 5i, . . . , 5^ can be found. 

4.3.2 Implementing the Stair Algorithm in General Cost Settings 

In general cost settings, algorithm S"^ may incur negative value if, for some i, the cost of set Si is 
large relative to Vi. To bound the value of A we must therefore limit the costs of sets 5i, . . . , 5„. 
Our upper bound on cost will depend on the following quantity, which relates to the structure of 
the piecewise constant intervals for algorithm A. 

Definition 4.8 Suppose A has piece-wise constant allocation rules, where li = {/i, I2, ■ ■ ■} are the 
constant intervals for agent i. The stair threshold for agent i, wf', is defined as wf- := max/i. 
That is, wf' is the upper endpoint of the first valuation space interval for agent i. 

We can now relate the value of A to the cost of sets S*! , . . . , 5„ and the stair thresholds of algorithm 
A. 

Lemma 4.11 If there exists X > such that c{Si) < wf + X for all i, then A > A — d{nfi^s,x + X). 

Proof: By construction, A = {1 — 6)A+6S'^ . Recall that S'^ chooses some i uniformly at random, 
and then either allocates Si or 0. Moreover, S"^ will always allocate if Vi is in the first piece of the 
valuation space; that is, if Vi < wf. We therefore conclude that S'^ > ^ ^jminjii;-^ — c{Si),0} > 

—X. Also, A < n^yaa.x trivially. Thus ^ > ^ — Snfi^aax + ^{—X)) = A — 5(n//niax + X). □ 
Our goal will be to find sets Si with c{Si) < wf + npu^^^j then apply Lemma 14.111 with 
X = n^max/\/e- To find such sets, we will apply the same sampling techniques used in the 
construction of Ae- That is, for each i and each piece of the valuation space, we will run A on 
many sample inputs. As long as Xi{I) is not too small on a given interval I, we are very likely to find 
some valid allocation that includes agent i during the sampling process! Moreover, we will show 
that not all sets discovered in this way can have high cost, so with high probability we will find a 
set Si with cost at most n|Umax/-v/e- To relate the cost of set Si with wf, we will also iron together 
all left-most intervals for which a low-cost set was not found. These ironed-together intervals will 
then act like a single piece of valuation space, which will allow us to relate the cost of any set Si 
we do find to the stair threshold for the (modified) algorithm. 

Definition 4.9 {A^) Given piece-wise constant algorithm A, the stair-compatible algorithm for 
A, Al, is as follows: 
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1. For each agent i: 

2. Let Xi = {/i, . . . , /fc} he the constant valuation space intervals for agent i. 

3. For each Ij E Xj, draw e^^ log(n/2e) samples from F conditional on Vi £ Ij, and run A on 
each of these samples. 

4- Let ji be the minimal index such that, for some sample of interval Ij., A allocated a set Ti 
with Ti 3 i and c{Ti) < min/j. + n/imax/\/e- Choose Si to be any such Ti. 

5. If no such set T was returned for any interval, take ji = k + 1 and Si = {i}. 

6. Define = {/i U . . . U , 7^, , . . . , 4} . 

7. Run ^j'(v). 

Note that, as part of the execution of A^, a set Si 3 i will be found for each i, which is taken to he 
any set satisfying the conditions on line 4 for interval Ij. (or {i} if no sets were found). 

In summary, A'^ samples each constant interval for agent i, searching for an appropriate set Si. 
We take Ij. to be the leftmost interval for which such a set Si was found. All intervals to the left 
of Ij. are then ironed together. Thus, regardless of the sampling outcome, Ij. will be the second 
valuation space piece for agent i in algorithm Ail Thus c{Si) < ^ + n/imax/\/e- 

Lemma 4.12 The stair compatible algorithm A^ for A (Definition \4.B^ and stair thresholds w-^^ 
(Definition \4.8\ ) satisfy c{Si) < w. ' + n/Umax/\/e for all i. 

Proof: For each i, if no set satisfying the conditions on line 4 of A^ was found during the sampling 
of any interval, then ji = k + 1 and all intervals of A are ironed together in I'. In this case vu- " = oo, 
so c{Si) < w- ' trivially. Otherwise, by line 4 of ^f, c{Si) < mm Ij. + n[i^^^l However, since 
all intervals to the left of Ij. are ironed together in X' , Ij. will be the second piecewise constant 
interval of A\, and hence min/j. = w- " . Thus c(Si) < minw- " + n/imax/\/e as required. □ 

Lemma 4.13 The stair compatible algorithm A^ for A (Definition \4.9) satisfies Al > „4— 2n^max/\/e- 

Proof: We claim that, with probability at least 1 — |, for each agent i, the allocation rules for Aji 
(from line 7 A^) and A will differ only on values Vi for which Xi{vi) < \/e + |. Before proving the 
claim, let us see how it implies the desired result. The claim implies that Ai' > A — (\/e+ f )?T-Aimax 
with probability 1 — f • For the remaining probability, we note that Aj' > ^ — „4 > ^ — n/imax 
trivially. Thus, over all possible outcomes of sampling, we conclude that 

Al>A - (^^/e + 1^ n/imax - |n/imax = A - \/en//max 

as required. 

Let us now prove the claim. Choose some agent i and suppose that Aj' and A differ on some 
interval / with > ^/€ + |. Let / be the leftmost such interval. For the remainder of the proof 
we will say that a set T has low cost for I if c{T) < min / + n^max/ \/e- Then, by the definition 
of it must be that no set T 3 i with low cost was found during the sampling of interval I 
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for agent i. Let us bound the probability of this event. Given v ~ F, let B(v) be the event 
[xi(v) A f i < mini + n/iinax/\/e]- If event -B(v) occurs for some sample v, this means that A 
returned some allocation T 3 i and furthermore X^i — + nUmax/V^- But note that this 

allocation must generate non-negative profit (otherwise it would never be allocated), and hence T 
must have low cost for /. Thus B{v) is precisely the event that A returns a set T 3 i with low cost 
for /. 

Consider the probability of B(y). By Markov's inequality, Pfv Ylj^i'^j > '^^max/'v/e < ^/^■ 
Thus, since Vi > mini with probability 1 conditional on Vi G I, Prv[^j Vi > mini + n^max/\/e] < 
^/e. Also, Prv[-'2;j(v) | f « S /] = 1 — Xi{I) < 1 — {^/e + f )• The union bound then implies that 
Prvh^(v)] < 1 - (^/i + f ) + = 1 - f , so Prv[i?(v)] > f . 

By Chernoff-Hoeffding inequality, the probability that event B does not occur even once during 
e~^log(2n/e) samples is at most We conclude that the probability that no set T 3 i with low 
cost was found during the sampling of interval / is at most This is therefore a bound on the 
probability that Aj/ and A differ for agent i on some interval / with Xj (/) > ^/€ + | . By the union 
bound, the probability that this occurs for any agent is at most |, as required. □ 

We are now ready to describe algorithm Ae from the statement of Theorem 14.11 

Definition 4.10 {Ae) Given algorithm A and e > 0, the monotonization of A, An, 'proceeds as 
follows: 

1. Construct Ae, the discretized version of A. 

2. Construct Al, the stair- compatible version of Ae- This generates sets Si, . . . , Sn- 

3. Construct Ae, the statistically ironed algorithm for A^. 

4. With probability 6 = 2k{n — l)e, execute with sets Si, . . . , Sn- Else, execute Ae- 
Lemma 4.14 Ae is BIC, and Ae'> A — T/cn^y^/Xmax- 

Proof: Lemma [4.9l implies that Ae is e-close to a monotone algorithm, and during the construction 
of Al we find sets Si, . . . ,Sn with Si 3 i- Thus the convex combination of Ae with S'^" is BIC by 
Lemma 14.101 

We note that costs are not affected by our ironing techniques, and, by Lemma 14.131 

Al>Ae- 2Ven^max > ^ - S^/enfI^a.x- 
Also, c{Si) < ' + n/imax/\/e for all i by Lemma [4. 121 Thus, by Lemma [4.111 

Ae>Al- 5{nflraax + n/imax/\/e) 

> ^ - 3n^/imax - 2(2(A; - l)ne)n/imax/\/e 

>A - 7kn'^^/e^Iraa.^. 

□ 

Theorem 14.11 now follows immediately from Lemma 14.141 by considering algorithm Ae' , where 
e' = (e/7A;n^)^ = e^/49A;^n^. The runtime, which is dominated by the sampling in the construction 
of Ae, is 0(/cn(e')"^) = d{n^e-^ log^{vmax/eHms,^)). 
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4.3.3 Feasibility Settings 



In general feasibility settings, where costs are either or infinite, the performance of algorithm Ae 
improves significantly. Specifically, we can improve Lemma 14.131 as follows: 

Lemma 4.15 In feasibility settings, A'^ > A — 2en/imax- 

Proof: Consider some agent i and interval I £ 2, and suppose Xi (/) > | . Consider the probability 
of finding an allocation with cost at most min/ + e~^/^n//max when sampling for this interval. Since 
costs are either or oo, this is precisely the probability of finding an allocation that includes agent 
i, which is Xi{I) > |. By Chernoff-Hoeffding inequality, the probability that this event does not 
occur even once in e~^log(n/2e) samples is at most e/2n. We will therefore successfully find a set 
Si 3 i with probability at least 1 — e/2n. 

For each i, let Ij. denote the leftmost interval on which Xj(7) > e. By the union bound, with 
probability 1 — e/2 we will find a set Si 3 i when sampling interval Ij-, for all i. In this case, the 
behavior of algorithms A^ and A differ only on intervals / to the left of Ij-, all of which satisfy 
Xi{I) < |. Thus, conditioning on an event of probability 1 — |, > ^(1 — f ) > -4, — n/Umaxe- For 
the remaining probability, |, we note that A < n^^^.^ trivially. We conclude that Ac > ^— 2en/Xniax 
unconditionally. □ 

Using Lemma 14.151 instead of Lemma 14.131 in the analysis of Ae , we find that the statement 
of Lemma 14.141 improves to show that Ae > A — T/cn^e/Xmax in feasibility settings. The second 
half of Theorem 14.11 then follows by considering Ae' with e' = Ikn^e, which has a runtime of 
0{kn{e'y^) = 0(n5e-5log3(fmax/e/Umax))- 



5 Conclusions and Open Questions 

Our main result is for the setting of single-parameter agents and the objective of social welfare where 
we give a black-box reduction that converts any Bayesian approximation algorithm into a Bayesian 
incentive compatible mechanism. For these settings there is no gap separating the approximation 
complexity of algorithms and BIC mechanisms. While this result is extremely general, the situations 
not covered by our main theorem are of notable interest. 

1. Multi-parameter Bayesian mechanism design is not very well understood; however, there is 
every reason to believe that approximation (which has not been pursued much by the eco- 
nomics literature) has a very interesting and relevant role to play in providing positive results. 
A notable example is in approximations for Bayesian auctions for unit-demand agents (for 
the objective of profit maximization). Here the performance of an optimal single-parameter 
mechanism gives an upper bound on that of the optimal multi-parameter mechanism, and a 
mechanism for the multi-parameter setting is able to simulate the outcome of this optimal 
single-parameter mechanism without much loss in performance. See [6] for details. 

We would like to know whether there is a gap separating the approximation complexity of 
algorithms and BIC mechanisms in multi-parameter settings for social welfare maximization. 
This question is set in the context of a recent lower bound by Papadimitriou et al. [13j 
for the combinatorial public project problem that shows that there is a gap separating the 
approximation complexity of algorithms and ex post IC mechanisms. This result, however, 
does not immediately extend to show a separation for BIC mechanisms. An important first 



18 



step in addressing the Bayesian multi-parameter setting is captured by the fohowing question. 
Given a black-box for a weakly-monotone allocation rule^ can can BIC payments be computed 
in polynomial time? (That is, can one generahze Archer et ah's payment computation from 
the single-parameter setting [2J to the multi-parameter setting?) 

2. Our reduction applies to the objective of social welfare maximization. It would be nice to 
extend our result more generally to any monotone objective function, e.g. makespan. Unfor- 
tunately, our approach fails to preserve the approximation factor of the makespan objective. 
For welfare our ironing procedure (weakly) increases the objective because we become more 
likely to allocate to a higher-valued agent at the expense of being less likely to allocate to 
the same agent with a lower value. For makespan, allocating in the higher-valued case may 
not improve the objective at all if the makespan is given by another agent. Thus, our loss for 
allocating less often in the lower-valued case is not offset by any improvement. A concrete 
example of this is given in Appendix lA.li Is there a polynomial-time reduction that turns any 
approximation algorithm for any monotone objective into a BIC mechanism with the same 
approximation factor? 

3. In the special case where our reduction is applied to a worst-case c- approximation (recall: our 
reduction applies more generally to Bayesian c-approximations), the resulting BIC mechanism 
is still only an c- approximation in the weaker Bayesian sense. A concrete example of this 
is given in Appendix IA.2[ Is there a polynomial-time black-box reduction that turns any 
worst-case c- approximation algorithm into a BIC mechanism that is also a worst-case c- 
approximation ? 

4. While Bayes-Nash equilibrium (i.e., BIC) is the standard equilibrium concept for implementa- 
tion in economics, the stronger dominant strategy equilibrium (i.e., ex post IC) is the standard 
equilibrium concept for implementation in computer science. The main challenge in obtaining 
a similar reduction for IC mechanisms is that the valuation space is exponentially big and 
monotonizing all points seems to require an exhaustive procedure. The key to the approach 
of our BIC reduction is that resampling bids from the distribution can be done independently 
for each agent without changing an agent's prior on the other agent's bids. This does not 
seem to be the case in non-Bayesian settings. One potential approach would be to apply our 
ironing technique repeatedly, re-ironing an agent's curve whenever it is affected by an ironing 
of another agent's curve. Such a procedure can be argued to terminate, but will not generally 
give a monotone allocation rule. A concrete example is given in Appendix IA.4[ Is there a 
polynomial-time reduction for turning any f{n)- approximation (worst-case or Bayesian) al- 
gorithm for a single- parameter domain into an ex post IC mechanism that is a (worst-case 
or Bayesian) Q{f[n))- approximation? 

The final question above can be rephrased intuitively as our main open question. Is there a gap 
separating the approximation complexity of implementation by Bayesian incentive compatible and 
ex post incentive compatible mechanisms ? 

®Weak-monotonicity is a generalization of the monotone allocation rule requirement to multi-parameter settings. 
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A Counterexamples 
A.l Beyond Social Welfare 

In the ideal model, our general reduction applies to any single-parameter optimization problem and 
converts an algorithm into a mechanism with at least the same expected social welfare. Unfortu- 
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Figure 3: The allocation rule for algorithm A. The vertical axis corresponds to vi, the horizontal 
to V2, and the table entries are of the form "xi, X2" ■ For example, if {vi,V2) = (100,10), then 

(X1,X2) = (1,0). 



nately, this approach does not preserve other relevant objective values, even monotone ones such 
as the makespan. We illustrate this deficiency of the approach with an example. 

Consider the problem of jo6 scheduling on related machines with the goal of minimizing makespan. 
Here the machines are agents and each has a (privately-known) speed. The time a job takes on 
a machine is the product of its length and the machine's speed. The goal of the mechanism is to 
assign a given set of jobs with varying lengths to the machines so as to minimize the time until all 
machines have finished processing their jobs, a.k.a., the makespan. 

Consider an instance in which we have 10 unit-length jobs, and 5 machines. We assume a 
Bayesian setting, where the speeds of the machines are probabilistic. The first 4 machines are 
identical: they all have speed 2 with probability 1. The last machine has either speed 1 or speed 
2, each with probability 1/2. 

Suppose A behaves in the following way. When 55 = 2, it will choose Xi = 2 for all i, resulting 
in a makespan of 1. When s^ = 1, A sets x = (6, 1, 0, 0, 3), resulting in a makespan of 3 (whereas 
the optimal is 1.5). Thus the average expected makespan achieved by A is 2. 

We note that, for this algorithm, the allocation curve for machine 5 is not monotone. Our 
monotonization procedure will therefore iron the valuation space of machine 5. The optimal iron- 
ing of this curve will draw s'^ uniformly from {1,2}. This causes 4 equally likely possibilities, 
corresponding to (55,55) G {1,2}^. 

If S5 = S5 then A is proceeding as though no ironing occurred: if 55 = S5 = 1 then the the 
makespan is 1, and if S5 = S5 = 2 the makespan is 3. Suppose S5 = 2, S5 = 1. Then A forms 
allocation x as though machine 5 has speed 1, though it actually has speed 2. Hence we obtain 
x = (6,1,0,0,3), for a makespan of 3. If, on the other hand, S5 = IjSg = 2 then we obtain 
X = (2, 2, 2, 2, 2), for a makespan of 2. 

We conclude that the expected makespan of the ironed procedure is ^+^+^+'^ = 2.25, which is 
strictly worse than the expected makespan obtained by the original algorithm. 

A. 2 Failure to Preserve Worst-Case Approximations 

We present an example to demonstrate that ideal ironing does not preserve worst-case approxima- 
tion ratios. Consider an auction of 2 objects to 2 unit-demand bidders, with the goal of optimizing 
social welfare. The private value of agent 1 is drawn uniformly from {1, 100}, and the private value 
of agent 2 is drawn uniformly from {10, 1000, 1001}. Let A be an approximation algorithm whose 
allocation rule is described in Figure [3j 

We note that ^ is a worst-case 11/10 approximation algorithm (where the optimal solution 
is to always allocate to both players). Also, A is not BIC for agent 1: £'[xi(l)] = 2/3, whereas 
i?[3;i(100)] = 1/3. The ideal monotonization procedure will draw a new bid v[ for agent 1 uniformly 
from {1, 100}, and run A on {v[, V2)- Call this new algorithm A. 
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Note that if vi = 100 and V2 = 10, then with probability 1/2 A will take v'l = 1 and choose allo- 
cation (0, 1), and with the remaining probability it will take v[ = 100 and choose allocation (1,0). 
Hence, for this set of input values, the expected welfare obtained by A is = 55. Since 110 is 

optimal, A is at best a 2-approximation algorithm, whereas A is an 11/10-approximation algorithm. 
We conclude that ideal ironing can cause a significant decrease in worst-case approximation ratios. 

A. 3 Ironing Allocation Rules vs. Ironing Virtual Valuations 

The ironing procedure described in this paper is reminiscent of the ironing procedure used by 
Myerson for maximizing revenue [12]. Myerson's optimal mechanism allocates to maximize virtual 
value (defined by (p{v) = v — ~jr^y')- When the virtual valuation functions are non-monotone, this 
mechanism first irons the virtual valuation functions, then allocates the maximize ironed virtual 
value. A natural question is whether this is identical to ironing the non-monotone allocation rule 
that would result from maximizing non-ironed virtual values. We show that these two mechanisms 
are distinct; therefore the approach of ironing allocations is suboptimal in terms of revenue. 

Consider the following distribution. With probability 1/2, the value is drawn uniformly from 
interval [10, 11]. With the remaining probability, the value is drawn uniformly from [11, 15]. This 
results in a virtual valuation function defined by 

|2„-12 110,11] 

[2v-15 ve (11,15]. 

The motivation behind this distribution is that (j){v) > for all v £ [10, 15], and the minimum of 
(j){v) does not occur at 10: 0(10) > 0(11 -|- e) for sufficiently small e. 

Consider the allocation rule specified by assigning the item to the agent with the highest (non- 
ironed) virtual value. By symmetry, all players' allocation curves are the same, say x(-). Note 
x(10) > 0, since there is a non-zero probability that all other players have virtual values less than 
0(10). 

Next consider the allocation to maximize ironed virtual valuations. Let 0' denote the ironed 
virtual valuation function. Then 0' is monotone, and it can be verified that 0'(1O) < (p'{v) for all 
V > 10. Hence, if x'{-) is the allocation curve for this mechanism, we have x'(lO) = 0. 

Finally, consider the allocation rule x"{-) that would result from ironing the allocation rule x(-) 
as suggested by Definition 13.11 Since x(10) > and x{v) > for all v G [10, 15], it must be that 
x"(10) > as it is a probability weighted average of some interval containing 10. 

Thus, since x'(lO) = / x"(10), we conclude that x' 7^ x" and hence these two different ironing 
procedures result in different mechanisms. 

A. 4 Recursive Ironing does not Guarantee ex post Incentive Compatibility 

In order for a mechanism to guarantee ex-post incentive compatibility, it must be that, for all i G [n], 
the allocation rule Xi is monotone for any choice ofv-i. Monotonizing each agent's allocation rule 
independently is insufficient to obtain this goal. Indeed, it is easy to construct examples where 
each each agent's allocation curve is monotone in expectation, but non- monotone for a particular 
choice of the other agents' bids. 

One might imagine the following recursive approach for obtaining ex-post incentive compati- 
bility. Begin by assuming that each agents' input is drawn from the singleton interval /j = [vj,?;^], 
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Figure 4: The allocation rule for algorithm A. The vertical axis corresponds to possible values vi, 
the horizontal axis corresponds to possible values V2, and the table entries denote the probability 
of allocation to both agents. For example, if (fi,i'2) = (1,4), then (xi,X2) = (0.22,0.22). 
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Figure 5: The results of the recursive monotonization procedure for algorithm A, on input (1,5), 
(a) after 1 step, (b) after 2 steps. 



and let I denote the cube /i x /2 x • • • x Choose some agent i whose allocation curve is not 
monotone, under the assumption that each other agents' values are drawn from cube I, and iron 
that agent's curve under this assumption. This ironing process may enlarge the interval from which 
agent i's value is drawn; update /j to be this new interval. Repeat this process, choosing a new 
agent on each iteration, until all agents' curves are monotone. 

Unfortunately, as we now demonstrate, the above procedure fails to guarantee ex-post incentive 
compatibility. Consider an auction setting with 2 agents, such that either both agents receive an 
allocation or neither does. Suppose A is the algorithm with allocation rule described in Figure HI 

Consider the application of our recursive monotonization technique on this algorithm when 
(fi,f2) = (1,5). Suppose we choose to monotonize the allocation curve for agent 2. Applying our 
monotonization procedure to the values (0.8,0.2,0.82,0.22,0.84,0.24), we obtain ironed intervals 
{1,2}, {3,4}, and {5,6}. The resulting allocation rule is shown in figure[5^a). 

We next monotonize curve xi at the point V2 = 5. This curve is non-monotone for agent 1, 
and the resulting ironed interval is {1,2}. The resulting allocation rule is shown in figure [5^b). 
After this monotonization, the allocation curves for all agents are monotone. The final expected 
allocation probability for both players is 0.47 (the entry at {vi,V2) = (1,5)). 

Next consider the application of this technique when (^1,^2) = (2,5). Monotonizing agent 2 
first, we apply our procedure to values (0.2,0.6,0.6,0.2,0.2,0.6) and we obtained ironed interval 
{2, 3, 4, 5}. The resulting allocation rule is shown in figure El^a). 

We next monotonize curve xi at the point V2 = 5. This curve is non-monotone for agent 1, and 
the resulting ironed interval is {1,2}. The resulting allocation rule is shown in figure [6]^b). After 
this monotonization, the allocation curve for agent 2 is no longer monotone, so we must iron again 
over the interval {1, 2, 3, 4, 5, 6}. At this point all agents' allocation curves are monotone. The final 
expected allocation probability for both players is 0.46. 

What we have shown is that, given ^2 = 5, our recursive monotonization procedure generates 
the allocation rule xi(l) = 0.47 > 0.46 = xi(2) for agent 1. This procedure therefore does not 
result in a monotone allocation rule, and hence does not obtain ex-post incentive compatibility. 
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ure 6: The results of the recursive monotonization procedure for algorithm A, on input (2,5), 
after 1 step, (b) after 2 steps, (c) after 3 steps. 
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